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Abstract 

We present the first fully dynamic worst case I/O-emcient data structures that support planar orthogonal 
3-sided range skyline reporting queries in 0(log 2B e n+ B /_ e ) I/Os and updates in 0(log 2BE n) I/Os, using 
0( B "_ e ) blocks of space, for n input planar points, t reported points, and parameter < e < 1. We obtain 
the result by extending Sundar's priority queues with attrition to support the operations DeleteMin 
and CatenateAndAttrite in 0(1) worst case I/Os, and in 0(1/B) amortized I/Os given that a 
constant number of blocks is already loaded in main memory. Finally, we show that any pointer-based 
static data structure that supports dominated maxima reporting queries, namely the difficult special case 
of 4-sided skyline queries, in ©(log ' 1 ' n+t) worst case time must occupy ^( n i ^g n ) space, by adapting 
a similar lower bounding argument for planar 4-sided range reporting queries. 



1 Introduction 

We study the problem of maintaining a set of planar points in external memory subject to insertions and 
deletions of points in order to support planar orthogonal 3-sided range skyline reporting queries efficiently 
in the worst case. For two points p,q £ R d , we say that p dominates q, if and only if all the coordinates of p 
are greater than those of q. The skyline of a pointset P consists of the maximal points of P, which are the 
points in P that are not dominated by any other point in P. Planar 3-sided range skyline reporting queries 
that report the maximal points among the points that lie 

Skyline computation has been receiving increasing attention in the field of databases since the introduction 
of the skyline operator for SQL [3 . Skyline points correspond to the "interesting" entries of a relational 
database as they are optimal simultaneously over all attributes. The considered variant of planar skyline 
queries adds the capability of reporting the interesting entries among those input entries whose attribute 
values belong to a given 3-sided range. Databases used in practical applications usually process massive 
amounts of data in dynamic environments, where the data can be modified by update operations. Therefore 
we analyze our algorithms in the 1/ O model pQ , which is commonly used to capture the complexity of massive 
data computation. It assumes that the input data resides in the disk (external memory) divided in blocks 
of B consecutive words, and that computation occurs for free in the internal memory of size M words. An 
I/O-operation (I/O) reads a block of data from the disk into the internal memory, or writes a block of data 
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to the disk. Time complexity is expressed in number of I/Os, and space complexity in the number of blocks 
that the input data occupies on the disk. 

Previous Results Different approaches have been proposed for maintaining the d-dimensional skyline in 
external memory under update operations, assuming for example offline updates over data streams |19l 113) . 
only online deletions |20j . online average case updates |16j . arbitrary online updates [8] and online updates 
over moving input points [9] ■ The efficiency of all previous approaches is measured experimentally in terms 
of disk usage over average case data. However, even for the planar case, no 1/O-efficient structure exists that 
supports both arbitrary insertions and deletions in sublinear worst case I/Os. Regarding internal memory, 
Brodal and Tsakalidis |3] present two linear space dynamic data structures that support 3-sided range skyline 
reporting queries in 0(logn + t) and O ( ^"^g n +0 worst case time, and updates in O(logn) and O ( ^° fj^ - ) 
worst case time in the pointer machine and the RAM model, respectively, where n is the input size and t is 
the output size. They also present an O(n\ogn) space dynamic pointer-based data structure that supports 
4-sided range skyline reporting queries in C(log 2 n + t) worst case time and updates in O(log 2 n) worst case 
time. Adapting these structures to the I/O model attains 0(\og®^ n + t) query I/Os, which is undesired 
since 0(1) I/Os are spent per reported point. 

Regarding the static variant of the problem, Sheng and Tao [17] obtain an I/O-efficient algorithm that 
computes the skyline of a static d-dimensional pointset in 0(-glogi7 2 it) worst case I/Os, for d > 3, by 

adapting the internal memory algorithms of (XSJ E] to external memory. C(§ logj« -g) I/Os can be achieved 
for the planar case. There exist two O(nlogn) and 0{n y ^°^ n ) space static data structures that support 
planar 4-sided range skyline reporting queries in 0(log7i + t) and O ( l( ^°f^ n + t) worst case time, for the 
pointer machine and the RAM, respectively p~0| 17]. 

Our Results In Section [3] we present the basic building block of the structures for dynamic planar range 
skyline reporting queries that we present in Section[4j That is pointer-based I/O-efficient catenable priority 
queues with attrition (I/O-CPQAs) that support the operations DeleteMin and CatenateAndAttrite 
in 0(1/ B) amortized I/Os and in 0(1) worst case I/Os, using 0( B -g R ) disk blocks, after n calls to Catenate- 
AndAttrite and m calls to DeleteMin. The result is obtained by modifying appropriately a proposed 
implementation for priority queues with attrition of Sundar |18j . 

In Section [4] we present our main result, namely 1/O-efficient dynamic data structures that support 3- 
sided range skyline reporting queries in C(log 2B e n+ B i- e ) worst case I/Os and updates in 0(log 2 Be n) worst 
case I/Os, using 0( B "_ e ) blocks, for a parameter < e < 1. These are the first fully dynamic skyline data 
structures for external memory that support all operations in polylogarithmic worst case time. The results 
are obtained by following the approach of Overmars and van Leeuwen |14) for planar skyline maintainance and 
utilizing conflucntly persistent I/O-CPQAs (implemented with functional catenable deques [H]). Applying 
the same methodology to internal memory pointer-based CPQAs yields alternative implementations for 
dynamic 3-sided reporting in the pointer machine in the same bounds as in [3]. 

Finally, in Section [5] we prove that any pointer-based static data structure that supports reporting the 
maximal points among the points that are dominated by a given query point in 0(\og ^ n) worst case 
time must occupy ^■( , n ^°f t ^ n ) space, by adapting the similar lower bounding argument of Chazelle [5] for 
planar 4-sided range reporting queries to the considered dominated skyline reporting queries. These queries 
are termed as dominating minima reporting queries. The symmetric case of dominated maxima reporting 
queries is equivalent and comprises a special case of rectangular visibilty queries [15] and 4-sided range 
skyline reporting queries [H UH]- The result shows that the space usage of the pointer-based structures 
in [15l IU [10] is optimal within a O (log log n) factor, for the attained query time. 

2 Preliminaries 

Priority Queues with Attrition Sundar [18] introduces pointer-based priority queues with attrition 
(PQAs) that support the following operations in 0(1) worst case time on a set of elements drawn from a total 
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order: DeleteMin deletes and returns the minimum element from the PQA, and InsertAndAttrite(c) 
inserts element e into the PQA and removes all elements larger than e from the PQA. PQAs use space linear 
to the number of inserted elements minus the number of elements removed by DeleteMin. 

Functional Catenable Deques A dynamic data structure is persistent when it maintains its previous 
versions as update operations are performed on it. It is fully persistent when it permits accessing and 
updating the previous versions. In turn, it is called confluently persistent when it is fully persistent, and 
moreover it allows for two versions to be combined into a new version, by use of an update operation that 
merges the two versions. In this case, the versions form a directed acyclic version graph. A catenable deque 
is a list that stores a set of elements from a total order, and supports the operations Push and Inject that 
insert an element to the head and tail of the list respectively, Pop and Eject that remove the element from 
the head and tail of the list respectively, and Catenate that concatenates two lists into one. Kaplan and 
Tarjan present purely functional catenable deques that are confluently persistent and support the above 
operations in 0(1) worst case time. 

Searching Lower Bound in the Pointer Machine In the pointer machine model a data structure that 
stores a data set S and supports range reporting queries for a query set Q, can be modelled as a directed 
graph G of bounded out-degree. In particular, every node in G may be assigned an element of S or may 
contain some other useful information. For a query range Qi € Q, the algorithm navigates over the edges of 
G in order to locate all nodes that contain the answer to the query. The algorithm may also traverse other 
nodes. The time complexity of reporting the output of Qi is at least equal to the number of nodes accessed 
in graph G for Qi. To prove a lower bound we need to construct hard instances with particular properties, 
as discussed by Chazelle and Liu [SJ|5]. In particular, they define the graph G to be (a, w)- effective, if a 
query is supported in a(t + Ui) time, where t is the output size, a is a multiplicative factor for the output 
size (a = 0(1) for our purposes) and w is the additive factor. They also define a query set Q to be (m, u>)- 
favorable for a data set S, if \S n Q»j > w,VQj € Q and \S n l~l • • • l~l Q lm \ = 0(1), V«i < h ■ ■ ■ < i m . 
Intuitively, the first part of this property requires that the size of the output is large enough (at least u) so 
that it dominates the additive factor of lu in the time complexity. The second part requires that the query 
outputs have minimum overlap, in order to force G to be large without many nodes containing the output 
of many queries. The following lemma exploits these properties to provide a lower bound on the minimum 
size of G. 

Lemma 2.1. Q Lemma 2.3] For an {m,uj) -favorable graph G for the data set S, and for an (a, uj) -effective 
set of queries Q, G contains 0,(\Q\u>/m) nodes, for constant a and for any large enough u>. 

3 I/O-Efficient Catenable Priority Queues with Attrition 

In this Section, we present I/O-cfficient catenable priority queues with attrition (I/O-CPQAs) that store a 
set of elements from a total order in external memory, and support the following operations: 

FindMin(Q) returns the minimum element in I/O-CPQA Q. 

DeleteMin(Q) removes the minimum element e from I/O-CPQA Q and returns element e and the new 
I/O-CPQA Q' = Q\{e}. 

Catenate And Attrite (Q i, Q 2 ) concatenates I/O-CPQA Q 2 to the end of I/O-CPQA Q\, removes all 
elements in Qi that are larger than the minimum element in Q 2 , and returns a new I/O-CPQA 
Q\ = {e E Qi|e < min(<5 2 )} U Q 2 . We say that the removed elements have been attrited. 

InsertAndAttrite(Q, e) inserts element e at the end of Q and attrites all elements in Q that are larger 
than the value of e. 
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All operations take 0(1) worst case I/Os and 0(l/b) amortized I/Os, given that a constant number of 
blocks is already loaded into main memory, for a parameter 1 < b < B. To achieve the result, we modify an 
implementation for the PQAs of Sundar [18]. 

An I/O-CPQA Q consists of fcg + 2 deques of records, called the clean deque C(Q), the buffer deque B(Q) 
and the dirty deques Di(Q), . . . ,Dk Q (Q), where kq > 0. A record r — (l,p) consists of a buffer I of [b,4b] 
elements of strictly increasing value and a pointer p to an I/O-CPQA. The ordering of r is; first all elements 
of I and then all elements of the I/O-CPQA pointed to by p. We define the queue order of Q to be C(Q), B(Q) 
and -Di(Q), . . . , Dk Q (Q)- A record is simple when its pointer p is null. The clean deque and the buffer deque 
only contains simple records. See Figure [I] for an overview of the structure. 

C(Q) B(Q) D 1 {Q) Dfcg-i(O) D kQ (Q) 

Figure 1: A I/O CPQA Q consists of k Q + 2 deques of records; C(Q),B(Q),Di(Q), . . .,D kQ {Q). The records 
in C(Q) and B(Q) are simple, the records of Di(Q), . . . , D kQ (Q) may contain pointers to other I/O CPQA's. 
Gray recordsare always loaded in memory. 

Given a record r = (l,p) the minimum and maximum elements in the buffers of r, are denoted by min(r) = 
min(Z) and max(r) = max(Z), respectively. They appear respectively first and last in the queue order of I, 
since the buffer of r is sorted by value. Henceforth, we do not distinguish between an element and its value. 
Given a deque q the first and the last record is denoted by first (q) and last(g), respectively. Also rest(g) 
denotes all records of the deque q excluding the record first(g). Similarly, front(g) denotes all records for 
the deque q excluding the record last(q). The size \r\ of a record r is defined to be the number of elements 
in its buffer. The size |g| of a deque q is defined to be the number of records it contains. The size \Q\ of 
the I/O-CPQA Q is defined to be the number of elements that Q contains. For an I/O-CPQA Q we denote 
by first (Q) and last(Q), the first and last of the records in C(Q), B(Q), D\(Q), . . . , D kQ (Q) that exists, 
respectively. By middle(Q) we denote all records in Q and the records in the I/O-CPQAs pointed by Q, 
except for records first (Q) and last(Q) and the I/O-CPQAs they point to. We call an I/O-CPQA Q large 
if \Q\ > b and small otherwise. The minimum value of all elements stored in the I/O-CPQA Q is denote 
by min(<5). For an I/O-CPQA Q we maintain the following invariants: 

1.1) For every record r = (l,p) where pointer p points to I/O-CPQA Q' , max(7) < mm(Q') holds. 

1.2) In all deques of Q, where record r± — (h,pi) precedes record r<i = (^2,^2), max(Zi) < min^) holds. 

1.3) For the deques C{Q),B{Q) and Di(Q), max(last(C(Q))) < min(first(fl(Q))) < min(first(D 1 (Q))) 
holds. 

1.4) Element min(first(_Di((3))) has the minimum value among all the elements in the dirty deques Di(Q), . . . , Dk(Q). 

1.5) All records in the deques C(Q) and B(Q) are simple. 

1-6) \C{Q)\>Yllii\Di{Q)\+kQ-l. 

1.7) |first(C(Q))| < b holds, if and only if \Q\ < b holds. 

1.8) |last(Dfe Q (Q))| < b holds, if and only if record last(Dfc Q (Q)) is simple. In this case \r\ £ [b,5b] holds. 

From Invariants l|2j l{3]and l|4j we have that the minimum element min(Q) stored in the I/O-CPQA Q is 
element min(first(C(Q))). We say that an operation improves or aggravates by a parameter c the inequality 
of invariant I j6] for I/O-CPQA Q, when the operation increases or decreases A(Q) = \C(Q)\ — J2i=i \Di{Q)\ — 
kg + 1 by c, respectively. To argue about the 0(1/6) amortized I/O bounds we define the following potential 
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functions for large and small I/O-CPQAs. In particular, for large I/O-CPQAs Q, the potential Q(Q) is 
defined as 

*(Q) = $ F (|nrst(Q)|) + |middlc(Q)| + $ L (|last(Q)|), 

where 

f 3- f , b< x <2b ( 
*,(*)=< 1, 2 ^<3 and $ L (x) = | ^ J - < 56 

( f-5, 36 < z < 46 L 6 ~~ 

For small I/O-CPQAs Q, the potential ^(Q) is defined as 




The total potential $t is defined as 

$ T = ^$(Q)+ ^ 1, 

Q Q|6<|Q| 

where the first sum is over all I/O-CPQAs Q and the second sum is only over all large I/O-CPQAs Q. 
3.1 Operations 

In the following, we describe the algorithms that implement the operations supported by the I/O-CPQA Q. 
The operations call the auxiliary operation Bias(Q), which will be described last, that improves the inequality 
of invariant lj6] for Q by at least 1. All operations take 0(1) worst case I/Os. We also show that every 
operation takes 0(1/6) amortized I/Os, where 1 < 6 < B. 

FindMin(Q) returns the value min(first(C(Q))). 

DeleteMin(Q) removes element e = min(first(C(Q))) from record (l,p) = first(C(Q)). After the removal, 
if |/| < 6 and \Q\ > b hold, we do the following. If 6 < |first(rest(C(Q)))| < 26, then we merge first(C(Q)) 
with first(rest(C(<5))) into one record which is the new first record. Else if 26 < |first(rest(C(<5)))| < 36 
then we take 6 elements out of first(rest(C(<5))) and put them into first(C(Q)). Else we have that 36 < 
|first(rest(C(Q)))|, and as a result we take 26 elements out of first(rest(0(<5))) and put them into first(C(<5)). 
If the inequality for Q is aggravated by 1 we call Bias(Q) once. Finally, element e is returned. 
Amortization: Only if the size of first(C(<5)) becomes |first(C(Q))| = 6 — 1 do we incur any I/Os. In this 
case r — first(Q) has a potential of $f(kl) = 2, and since we increase the number of elements in r by 6 to 26 
elements, the potential of r will then only be $ F (|r|) = 1. Thus, the total potential decreases by 1, which 
also pays for any I/Os including those incurred if Bias(Q) is invoked. 

CatenateAndAttrite(<3i, Q2) concatenates Q2 to the end of Qi and removes the elements from Q\ with 
value larger than min(Q2)- To do so, it creates a new I/O-CPQA Q[ by modifying Q\ and Q2, and by 
calling BiAS(Qi) and Bias(Q 2 )- 

If |Qi| < 6, then Qi is only one record (Zi, •), and so we prepend it into the first record (I2, •) = first(Q2) 
of Q 2 - Let l[ be the non-attrited elements of l\. We perform the prepend as follows. If |^| + | | < 46, then 
we prepend l[ into 12- Else, we take 26 — |Z^| elements out of Z2, and make them along with l' x the new first 
record of Q2- 

Amortization: If we simply prepend l[ into Z2, then the potential $<?(|Zi|) pays for the increase in potential 
of $i?(|first(C(Q2))|)- Else, we take 26— |Z^| elements out of I2, and these elements along with l[ become the 
new first record of Q2 of size 26. Thus, $f(26) = 1 and the potential drops by 1, which is enough to pay for 
the I/Os used to flush the old first record of C{Q2) to disk. 

If IQ2I < 6, then Q2 only consists of one record. We have two cases, depending on how much of Q\ is attrited 
by Q2- Let r% be the second last record for Q\ and let r 2 = last(Qi) be the last record. If e attrites all of 
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r%, then we just pick the appropriate case among ([I]j4]) below. Else if e attrites partially r%, but not all of it, 
then we delete T2 and we merge T\ and Q2 into the new last record of Q\ 1 which cannot be larger than 56. 
Otherwise if e attrites partially T2, but not all of it, then we simply append the single record of Q2 into r2, 
which will be the new last record of Q\ and it cannot be larger than 56. 

Amortization: If e attrites all of r±, then we release at least 1 in potential, so all costs in any of the cases 
([TJQ are paid for. If e attrites partially 7*1, then the new record cannot contain more than 56 elements, 
and thus any increase in potential is paid for by the potential of Q^- Thus, the I/O cost is covered by the 
decrease of 1 in potential, caused by r±. If e attrites partially r 2 , any increase in potential is paid for by the 
potential of Q2 ■ 

We have now dealt with the case where Qi is a small queue, so in the following we assume that Qi is large. 
Let e = min(<52)- 

1) If e < min(first(C(Qi))), we discard I/O-CPQA Q 1 and set Q[ = Q 2 . 

2) Else if e < max(last(C(Qi))), we remove the simple record (I,-) = first(C(Q2)) from C(Q2), we set 
C(Q[) — 0, B(Q' 1 ) — C(Qi) and Di(Q' 1 ) = (l,p), where p points to Q 2 , if it exists. This aggravates the 
inequality for Q2 by at most 1, and gives A(Q' 1 ) = —1. Thus, we call Bias(Q2) once and Bias(Q' 1 ) once. 

3) Else if e < min(first(S((5i))) or e < min(first(£>i((3i))) holds, we remove the simple record (I,-) — 
first(C(Q2)) from C(Q2), set Di(Q[) = (l,p), and makep point to Q2, if it exists. If e < min(first(£>((3i))), 
we set B{Q' 1 ) = 0. This aggravates the inequality for Q 2 by at most 1, and aggravates the inequality 
for Qx by at most 1. Thus, we call Bias(Q2) once and Bias(Q' 1 ) once. 

4) Else, let (Z 1( -) = last(D fcQi ). We remove (/ 2 ,-) = first (C(Q 2 )) from C(Q 2 )- If \h\ < 6, then remove 
the record (h,-) from Dk Qi - Let l[ be the non-attrited elements under attrition by e = min^). If 
l^'il + < 46, then we prepend l[ into I2 of record T2 = (h,P2), where P2 points to Q2- Otherwise, 
we make a new simple record r\ with l[ and 26 elements taken out of r 2 = (^2,^2)- Finally, we put the 
resulting one or two records r\ and T2 into a new deque Dfc Qi +i(Qi)- This aggravates the inequality 
for Q 2 by at most 1, and the inequality for Qi by at most 2. Thus, we call Bias(Q2) once and Bias(Q 1 ) 
twice. 

Amortization: In all the cases both Qi and Q2 are large, hence when we concatenate them we decrease 

the potential by at least 1, as the number of large I/O-CPQA's decrease by one which is enough to pay for 
any Bias operations. 

Insert AndAttrite(Q, e) inserts an element e into I/O-CPQA Q and attrites the elements in Q with value 
larger than e. This is a special case of operation CatenateAndAttrite(<5i,(52), where Qi = Q and Q 2 is 
an I/O-CPQA that only contains one record with the single element e. 

Amortization: Since creating a new I/O-CPQA with only one element and calling CatenateAndAttrite 
only costs 0(1/6) I/Os amortized, the operation InsertAndAttrite also costs 0(1/6) I/Os amortized. 

C(Q) D X {Q) 

™/ ::=l ™3 n 

/ C(Q') B(Q') Dx{Q') n, :: AO') 

Figure 2: In the case of Bias(Q), where B{Q) = and fcg = 1, we need to follow the pointer p of (Up) — 
first(I?i(Q)) that may point to an I/O-CPQA Q' . If so, we merge it into Q, taking into account attrition 
of Q' by e = min(first(£>i(Q))). 
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Bias(Q) improves the inequality in l{6]for Q by at least 1. 

Amortization: Since all I/Os incurred by Bias(Q) are already paid for by the operation that called BlAS(Q), 
we only need to argue that the potential of Q does not increase due to the changes that Bias(Q) makes 
to Q. 

1) \B{Q)\ > 0: We remove the first record first(B(Q)) = (h,-) from B(Q) and let (l 2 ,p 2 ) = first(Di(Q)). 
Let l[ be the non-attritcd elements of li under attrition from e = min(Z 2 )- 

1) < < 6: If \l 2 1 < 26, then we just prepend l[ onto l 2 . Else, we take 6 elements out of l 2 and 
append them to l[. 

2) b < < 2b: If \l 2 \ < 2b, and if furthermore + \l 2 \ < 36 holds, then we merge l[ and l 2 . 
Else \ + \l 2 \ > 36 holds, so we take 26 elements out of l[ and l 2 and put them into l[, leaving the 
rest in l 2 . 

Else \l 2 1 > 26 holds, so we take 6 elements out of l 2 and put them into l[. 

If we did not prepend l\ onto l 2 , we insert l[ along with any elements taken out of l 2 at the end of C(Q) 
instead. If < \h\, we set B(Q) = 0. Else, we did prepend l[ onto l 2 , and then we just recursively 
call Bias. Since |i?(Q)| =0we will not end up in this case again. As a result, in all cases the inequality 
of Q is improved by 1. 

Amortization: If l\ — first(Q), then after calling Bias we ensure that 26 < |first(Q)| < 36, and so the 
that potential of Q does not increase. 

2) = 0: When \B(Q) \ = holds, we have two cases depending on the number of dirty queues, namely 
cases k,Q > 1 and kq = 1. 

1) kQ > 1: Let e = min(first(£>fc Q (Q)))- If e < min(last(Z3fe Q _i((5))) holds, we remove the record 
last(Z?fe Q _i(Q)) from Dk Q ~i(Q). This improves the inequality of Q by 1. 

Else, if min(last(£>A;Q-i(Q))) < e < max(last(Z3fc Q _i(Q))) holds, we remove record r\ = (h,pi) = 
last(£>fe Q _i(Q)) from Dk Q -i(Q) and let r 2 = (l 2 ,p 2 ) = first {Dk Q (Q))- We delete any elements in l\ 
that are attrited by e, and let l[ denote the non-attrited elements. 

1) < < 6: If I/2I < 26, then we just prepend l[ onto l 2 . Otherwise, we take 6 elements out of l 2 
and append them to l[. 

2) If 6 < < 26: If \l 2 \ < 26 and + \l 2 \ < 36, then we merge l[ and l 2 . Else, + \l 2 \ > 36 holds, 
so we take 26 elements out of l' x and l 2 and put them into Z' 1; leaving the rest in l 2 . 

Else |^2 1 > 26, so we take 6 elements out of l 2 and put them into l[. 

If r\ still exists, we insert it in the front of Dk Q (Q). Finally, we concatenate Dk Q -i(Q) and Dk Q {Q) 
into one deque. This improves the inequality of Q by at least 1. 

Else max(last(-DfcQ-i(Q))) < e holds, and we just concatenate the deques Dk Q -i(Q) and Dk Q (Q), 
which improves the inequality for Q by 1. 

Amortization: If not all of l\ is attrited then we ensure that its record r\ has size between 26 and 36. 
Thus, if r\ — first(Q) holds, we will not have increased the potential of Q. In the cases where all or 
none of l\ is attrited, the potential of Q can only be decreased by at least 0. 

2) kQ = 1: In this case Q contains only deques C(Q) and D 1 (Q). We remove the record r — (l,p) = 
first(-Di((3)) and insert I into a new record at the end of C(Q). This improves the inequality of Q 
by at least 1. If r is not simple, let r's pointer p point to I/O-CPQA Q' . We restore I (5] for Q by 
merging I/O-CPQAs Q and Q' into one I/O-CPQA. See Figure [2] for this case of operation Bias. In 
particular, let e = min(first(Z?i((2))), we now proceed as follows: 

If e < min(Q'), we discard Q' . The inequality for Q remains unaffected. 

Else, if min(first(C(Q'))) < e < max(last(C(Q')), we set B(Q) = C(Q') and discard the rest of Q' . 
The inequality for Q remains unaffected. 
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Else if max(last(C(0/)) < e < min(first(Di(Q'))), we concatenate the deque C(Q') at the end of C(Q). 
If moreover min(first(i?((3'))) < e holds, we set B(Q) — B(Q'). Finally, we discard the rest of Q' . 
This improves the inequality for Q by |C(Q')|. 

Else min(first(£>i(0/))) < e holds. We concatenate the deque C(Q') at the end of C(Q), we set B(Q) = 
B(Q'), we set Di(Q'), . . . , Dk Q i (Q') as the first key dirty queues of Q and we set D\(Q) as the last 
dirty queue of Q. This improves the inequality for Q by A(Q') > 0, since Q' satisfied l|6] before the 
operation. 

If r = first (Q) and |Z| < 26, then we remove r and run Bias recursively. Let r' = (l',p') = first (Q). 
If \l\ + \l'\ > 36, then we take the 26 first elements out and make them the new first record of C(Q). 
Else we merge I into V , so that r is removed and r' is now first(Q). 

Amortization: Since first(Q) is either untouched or left with 2b to 36 elements, in which case its 
potential is 1, and since all other changes decrease the potential by at least 0, we have that Bias does 
not increase the potential of Q. 

Theorem 3.1. A set oft I/O-CPQA 's can be maintained supporting the operations FindMin, DeleteMin, 
CatenateAndAttrite and InsertAndAttrite in 0(1/6) I/Os amortized and 0(1) worst case I/Os per 
operation. The space usage is 0( 1! -^ R ) blocks after calling Catenate And Attrite and InsertAndAt- 
trite n times and DeleteMin m times, respectively. We require that M > lb for 1 < 6 < B , where M is 
the main memory size and B is the block size. 

Proof. The correctness follows by closely noticing that we maintain invariants l(T]-l|8j and from those we 
have that DeleteMin(Q) and FindMin(Q) always returns the minimum element of Q. 

The worst case I/O bound of 0(1) is trivial as every operation only touches 0(1) records. Although Bias 
is recursive, we notice that in the case where |-B(Q)| > 0, Bias only calls itself after making |-B(Q)| = 0, so 
it will not end up in this case again. Similarly, if |_B(Q)| = and /cq > 1 there might also be a recursive 
call to Bias. However, before the call at least 6 elements have been taken out of Q, and thus the following 
recursive call to Bias will ensure at least 6 more are taken out. This is enough to stop the recursion, which 
will have depth at most 3. 

The 0(1/6) amortized I/O bounds, follows from the potential analysis made throughout the description 
of each operation. □ 

3.2 Concatenating a Sequence of I/O-CPQAs 

We describe how to CatenateAndAttrite I/O-CPQAs Qi, Q 2 , . . . , Qt into a single I/O-CPQA in 0(1) 
worst case I/Os, given that DeleteMin is not called in the sequence of operations. We moreover im- 
pose two more assumptions. In particular, we say that I/O-CPQA Q is in state x G Z, if |C(Q)| = 
Si=i + kQ — 1 + x holds. Positive x implies that Bias(Q) will be called after the inequality for Q 

is aggravated by x + 1. Negative x implies that Bias(Q) need to be called x operations times in order to 
restore inequality for Q. So, we moreover assume that I/O-CPQAs Qi,i € [1,^] are at state at least +2, 
unless Qi contains only one record in which case it may be in state +1. We call a record r = (l,p) in an 
I/O-CPQA Qi critical, if r is accessed at some time during the sequence of operations. In particular, the criti- 
cal records for Q % are first(C(Q 2 )), first(rest(C(Qi))), last(0(Qi)), first (B(Qi)), first (Di(Qj)), last (D kQi (Qi)), 
and last(front(Z?fc Q (Qi))) if h exists. Otherwise, record last(Dfe Q ._i(<5i)) is critical. So, we moreover assume 
that the critical records for I/O-CPQAs Qi,i £ [1,^] are loaded into memory. 

The algorithm considers I/O-CPQAs Qi in decreasing index i (from right to left). It sets Q l = Qi and 
constructs the temporary I/O-CPQA Q 4 " 1 by calling CATENATEANDATTRlTE(Q i _ 1 ,(5 I ). This yields the 
final I/O-CPQA Q 1 . 

Lemma 3.1. I/O-CPQAs Q t ,i € [1,£] can be CatenateAndAttrite^ into a single I/O-CPQA without 
any access to external memory, provided that: 

1. Qi is in state at least +2, unless it contains only one record, in which case its state is at least +1, 
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2. all critical records of all Qi reside in main memory. 



Proof. To avoid any I/Os during the sequence of CatenateAndAttrites, we ensure that Bias is not 
called, and that the critical records are sufficient, and thus no more records need to be loaded into memory. 

To avoid calling Bias we prove by induction the invariant that the temporary I/O-CPQAs Q z ,i G [1,£] 
constructed during the sequence are in state at least +1. Let the invariant hold of Q %+1 and let Q l be 
constructed by CatenateAndAttrites^ ) . If Qi contains at most two records, which both reside 
in dequeue C{Q i ) ) we only need to access record first(C(<5 i+1 )) and the at most two records of Qi. The 
invariant holds for Q l , since it holds inductively for Q z+1 and the new records were added at C(Q l+1 ). As a 
result, the inequality of ijfijfor Q l+1 can only be improved. If Q l+1 consists of only one record, then either 
one of the following cases apply or we follow the steps described in operation CatenateAndAttrite. In 
the second case, there is no aggravation for the inequality of [6] and only critical records are used. 

In the following, we can safely assume that Qi has at least three records and its state is at least +2. We 
parse the cases of the CatenateAndAttrite algorithm assumming that e = min(Q' +1 ). 

Case 1 The invariant holds trivially since Qi is discarded and no change happens to Q l = Q l+1 . Bias is not 
called. 

Cases 2,3 The algorithm checks whether the first two records of C(Qi) are attrited by e. If this is the case, we 
continue as denoted at the start of this proof. Otherwise, case [2] of CatenateAndAttrite is applied 
as is. Q l+1 is in state after the concatenation and Q l is in state +1. Thus the invariant holds, and 
Bias is not. Note that all changes take place at the critical records of Qi and Q %+1 . 

Case 4 The algorithm works exactly as in case [4] of CatenateAndAttrite, with the following exception. At 
the end, Q l will be in state 0, since we added the deque Dk Qi+1 +i with a new record and the inequality 
of lj6]is aggrevated by 2. To restore the invariant we apply case 2(1) of Bias. This step requires access 
to records last(Dfe <3i _i) and first (Dk Qi )■ These records are both critical, since the former corresponds 
to last(Dfc Qi+1 ) and the latter to firstC(Q I+1 ). In addition, Bias(Q i+1 ) need not be called, since by 
the invariant, Q t+1 was in state +1 before the removal of firstC(Q l+1 ). In this way, we improve the 
inequality for Q l by 1 and invariant holds. 

□ 



4 Dynamic Planar Range Skyline Reporting 

In this Section we present dynamic I/O-efficient data structures that support 3-sided planar orthogonal range 
skyline reporting queries. 

3-Sided Skyline Reporting We describe how to utilize I/O-CPQAs in order to obtain dynamic data 
structures that support 3-sided range skyline reporting queries and arbitrary insertions and deletions of 
points, by modifying the approach of |14) for the pointer machine model. In particular, let P be a set of n 
points in the plane, sorted by x-coordinate. To access the points, we store their x-coordinates in an (a, 2a)- 
tree T with branching parameter a > 2 and leaf parameter k > 1. In particular, every node has degree 
within [a, 2a] and every leaf contains at most k consecutive by ^-coordinate input points. Every internal 
node u of T is associated with an I/O-CPQA whose non-attrited elements correspond to the maximal points 
among the points stored in the subtree of u. Moreover, u contains a representative block with the critical 



records of condition 2 in Lemma 3.1 for the I/O-CPQAs associated with its children nodes. 

To construct the structure, we proceed in a bottom up manner. First, we compute the maximal points 
among the points contained in every leaf of T. In particular for every leaf, we initialize an I/O-CPQA Q. 
We consider the points (p x ,p y ) stored in the block in increasing x-coordinate, and call InsertAndAt- 
trite(Q, — p y ). In this way, a point p in the block that is dominated by another point q in the block, is 
inserted before g in Q and has value —p y > —q y . Therefore, the dominated points in the block correspond 
to the attrited elements in Q. 
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We construct the I/O-CPQA for an internal node u of T by concatenating the already constructed 
I/O-CPQAs Qi at its children nodes Uj of u, for i £ [1, a] in Section [3| Then we call Bias to the resulting 



I/O-CPQA appropriately many times in order to satisfy condition 1 in Lemma 3.1 The procedure ends when 
the I/O-CPQA is constructed for the root of T. Notice that the order of concatenations follows implicitly 
the structure of the tree T. To insert (resp. delete) a point p = (p Xl p y ) to the structure, we first insert (resp. 
delete) p x to T. This identifies the leaf with the I/O-CPQA that contains p. We discard all I/O-CPQAs 
from the leaf to the root of T, and recompute them in a bottom up manner, as described above. 

To report the skyline among the points that lie within a given 3-sided query rectangle [xg, x r ] x [yb, +oo), 
it is necessary to obtain the maximal points in a subtree of a node u of T by querying the I/O-CPQA stored 
in u. Notice, however, that computing the I/O-CPQA of an internal node of T modifies the I/O-CPQAs of 
its children nodes. Therefore, we can only report the skyline of all points stored in T, by calling DeleteMin 
at the I/O-CPQA stored in the root of T. The rest of the I/O-CPQAs in T are not queriable in this way, 
since the corresponding nodes do not contain the version of their I/O-CPQA, before it is modified by the 
construction of the I/O-CPQA for their parent nodes. For this reason we render the involved I/O-CPQAs 
conflucntly persistent, by implementing their clean, buffer and dirty deques as purely functional catenable 
deques [TTJ. In fact, T encodes implicity the directed acyclic version graph of the confluently persistent I/O- 
CPQAs, by associating every node of T with the version of the I/O-CPQA at the time of its construction. 
Every internal node of T stores a representative block with the critical records for the versions of the I/O- 
CPQAs associated with its children nodes. Finally, the update operation discards the I/O-CPQA of a node 
in T, by performing in reverse the operations on the purely functional catenable deques involved in the 
construction of the I/O-CPQA (undo operation). 

With the above modification it suffices for the query operation to identify the two paths pi,p r from the 
root to the leaves of T that contain the x-successor point of xt and the a;- predecessor point of x r , respectively. 
Let R be the children nodes of the nodes on the paths pg and p r that do not belong to the paths themselves, 
and also lie within the query x-range. The subtrees of R divide the query arrange into disjoint x-ranges. 
We consider the nodes of R from left to right. In particular, for every non-leaf node in pg\Jp r , we load into 
memory the representative blocks of the versions of the I/O-CPQAs in its children nodes that belong to 
R. We call CatenateAndAttrite on the loaded I/O-CPQAs and on the resulting I/O-CPQAs for every 
node in pg U p r , as decribed in Section [3] The non-attrited elements in the resulting auxiliary I/O-CPQA 
correspond to the skyline of the points in the query x-range, that are not stored in the leaves of pg and p r . To 
report the output points of the query in increasing x-coordinate, we first report the maximal points within 
the query range among the points stored in the leaf of pg. Then we call DeleteMin to the auxiliary I/O- 
CPQA that returns the maximal points in increasing x-coordinate, and thus also in decreasing y-coordinate, 
and thus we terminate the reporting as soon as a skyline point with y-coordinate smaller than yt is returned. 
If the reporting has not terminated, we also report the rest of the maximal points within the query range 
that are contained in the leaf of p r . 

Theorem 4.1. There exist I/O-efficient dynamic data structures that store a set of n planar points and 
support reporting the t skyline points within a given 3-sided orthogonal range unbounded by the positive 
y-dimension in 0(log 2Bc n + tjB 1 ~ t ') worst case I/Os, and updates in 0(log 2Bc n) worst case I/Os, us- 
ing 0(nj B 1 ~ e ) disk blocks, for a parameter < e < 1. 

Proof. We set the buffer size parameter b of the I/O-CPQAs equal to the leaf parameter k of T, and we set 
the parameters a — 2B e and k — B 1_c for < e < 1. In this way, for a node of T, the representative blocks 
for all of its children nodes can be loaded into memory in 0(1) I/Os. Since every operation supported by 
an I/O-CPQA involves a 0(1) number of deque operations, I/O-CPQAs can be made confluently persistent 
without deteriorating their I/O and space complexity. Moreover, the undo operation takes 0(1) worst case 
I/Os, since the purely functional catenable deques are worst case efficient. 



Therefore by Theorem 3.1 an update operation takes 0(log 2Be B T_ 6 ) = 0(log 2BE n ) worst case I/Os. 



Lemma 3.1 takes 0(1) I/Os to construct the temporary I/O-C PQA s for every node in the search paths, 



since they satisfy both of its conditions. Moreover, by Theorem 3.1 it takes ( l °^fJ e - ) I/Os to catenate 



them together. Thus, the construction of the auxiliary query I/O-CPQA takes 0(log 2B£ n) worst case I/Os 
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in total. Moreover, it takes 0(1 + t/B 1 e ) worst case I/Os to report the output points. There are 0( ) 
internal nodes in T, and every internal node contains 0(1) blocks. □ 

4-Sided Skyline Reporting Dynamic I/O-cfhcient data structures for 4-sided range skyline reporting 

queries can be obtained by following the approach of Overmars and Wood for dynamic rectangular visibility 

queries [T3]. In particular, 4-sided range skyline reporting queries are supported in 0( log a a 1 °^ g ^ 1 B£ + t/B 1 ^'-) 

worst case I/Os, using 0( B "_ e log a n) blocks, by employing our structure for 3-sided range skyline reporting 

as a secondary structure on a dynamic range tree with branching parameter a, built over the y-dimension. 

1 2 

Updates are supported in 0( l °f a ™ 2Be ) worst case I/Os, since the secondary structures can be split or 
merged in 0(log 2 £ £ n) worst case I/Os. 

Remark 4.1. In the pointer machine, the above constructions attains the same complexities as the existing 
structures for dynamic 3-sided and 4-sided range maxima reporting J^j, by setting the buffer size, branching 
and leaf parameter to 0(1). 

5 Lower Bound for Dominating Minima Reporting 

Let S be a set of n points in K 2 . Let Q = {Qi} be a set of m orthogonal 2-sided query ranges Qi £ M. 2 . 
Range Qi is the subspace of M 2 that dominates a given point qi £ M 2 in the positive x- and y- direction (the 
"upper-right" quadrant defined by qi). Let Si = S fl Qi be the set of all points in S that lie in the range Qi. 
A dominating minima reporting query Qi contains the points min(S'i) £ Si that do not dominate any other 
point in Si. In this section we prove that any pointer-based data structure that supports dominating minima 
queries in O(log°^ n + t) time, must use superlinear space. This separates the problem from the easier 
problem of supporting dominating maxima queries and the more general 3-sided range skyline reporting 
queries. The same trade-off also holds for the symmetric dominated maxima reporting queries that are the 
simplest special case of 4-sided range skyline reporting queries that demands superlinear space. Moreover, 
the lower bound holds trivially for the I/O model, if no address arithmetic is being used. In particular, for 

a query time of ( los ^ — + -g) the data structure must definitely use ^(g i g^g W ) blocks of space. In the 
following, we prove the lower bound for the dominating minima reporting queries. 

Henceforth, we use the terminology presented in Section [2] Without loss of generality, we assume that 
n = lo x , since this restriction generates a countably infinite number of inputs and thus the lower bound is 
general. In our case, u = log 7 n holds for some 7 0, m = 2 and A = i +7 1 °o g " og „ ■ Let be the integer 

obtained by writing < i < n using A digits in base u>, by first reversing the digits and then taking their 
complement with respect to uj. In particular, if i = %q 'i^ • ■ ■ holds, then 

Pu (i) = - *t\ - 1)(« - A% - 1) ■ • . (a; - 4 W) - l)(w - tt ] 1) 

where is the j-th digit of number i in base u). We define the points of S to be the set {{i, p u (i))\0 < i < n}. 
Figure [3] shows an example with u> — 4, A = 2. 

To define the query set Q, we encode the set of points {p w (i)|0 < i < n} in a full trie structure of depth A. 
Recall that n — cj a . Notice that the trie structure is implicit and it is used only for presentation purposes. 
Input points correspond to the leaves of the trie and their y value is their label at the edges of the trie. 
Let v be an internal node at depth d (namely, v has d ancestors), whose prefix v 0l v\, . . . , w^-i corresponds 
to the path from v to the root r of the trie. We take all points in its subtree and sort them by y. From 
this sorted list we construct groups of size uj by always picking each w A ~ d-1 -th element starting from the 
smallest non-picked element. Each such group corresponds to the output of each query. See Figure [3] for an 
example. In this case, we say that the query is associated to node v. 

A node of with depth d has -\ points in its subtree and thus it defines at most "_ t queries. Thus, the 
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1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 

Figure 3: An example for lu = 4 and A = 2. Two examples of queries are shown, out of the 8 possible queries 
with different output. Connecting lines represent points whose L\ distance is Lu k , 1 < k < A. All 8 possible 
queries can be generated by translating the blue lines horizontally so that the answers of all 4 queries are 
disjoint. Similarly for the red lines with the exception that we translate them vertically. 

total number of queries is: 

A-1 A-1 , 

d=0 d=0 

This means that the total number of queries is 

.g. An logn 1 n 

lu 1 + 7 log log n log 7 n log 7-1 n(l + 7 log logn) 

The following lemma states that Q is appropriate for our purposes. 

Lemma 5.1. Q is (2, log 7 n)-favorable. 

Proof. First we prove that we can construct the queries so that they have output size u> = log 7 n. Assume 
that we take a group of lu consecutive points in the sorted order of points with respect to the y-coordinate 
at the subtree of node v at depth d. These have common prefix of length d. Let the y-coordinates of 
these points be Pui(ii), Pu(i2), ■ ■ ■ in increasing order, where — Pu{ij-i) = w -1 , 1 < j < u. 

This means that these numbers differ only at the A — d — 1-th digit. This is because they have a common 
prefix of length d since all points lie in the subtree of v. At the same time they have a common suffix of 
length A — d — 1 because of the property that p u (ij) — p u (ij-x) = w A ~ d ~\ 1 < j < uj which comes as a 
result from the way we chose these points. By inversing the procedure to construct these y-coordinates, 
the corresponding x-coordinates ij , 1 < j < u> are determined. By complementing we take the increasing 
sequence p u {ij), . . . ,Pu{h),Pu{h), where p u {ij) = u x -p u {ij)-l mApu{ij-i)-pu{ij) = ui x ~ d ~ 1 ) 1 < j < uj. 
By reversing the digits we finally get the increasing sequence of x-coordinates i u , . . . , 12, ix, since the numbers 
differ at only one digit. Thus, the group of lu points are decreasing as the ^-coordinates increase, and as 
a result a query q whose horizontal line is just below p u (ii) and the vertical line just to the left of p u {iu) 
will certainly contain this set of points in the query. In addition, there cannot be any other points between 
this sequence and the horizontal or vertical lines defining query q. This is because all points in the subtree 
of v have been sorted with respect to y, while the horizontal line is positioned just below p u {i\), so that no 
other element lies in between. In the same manner, no points to the left of Puj(i u ) exist, when positioning 
the vertical line of q appropriately. Thus, for each query q G Q, it holds that \S f~l q\ = to = log 7 n. 
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It is enough to prove that for any two query ranges p,q € Q, \S D q D p\ < 1 holds. Assume that p and q 
are associated to nodes v and u, respectively, and that their subtrees are disjoint. That is, u is not a proper 
ancestor or descendant of v. In this case, p and q share no common point, since each point is used only once 
in the trie. For the other case, assume without loss of generality that u is a proper ancestor of v (u ^ v). 
By the discussion in the previous paragraph, each query contains u> numbers that differ at one and only one 
digit. Since u is a proper ancestor of v, the corresponding digits will be different for the queries defined in u 
and for the queries defined in v. This implies that there can be at most one common point between these 
sequences, since the digit that changes for one query range is always set to a particular value for the other 
query range. The lemma follows. □ 



Lemma 5.1 allows us to apply Lemma 2.1 and thus the query time of C(log 7 n + t), for output size t, 



can only be achieved at a space cost of ft yn lo °f g „ J ■ The following theorem summarizes the result of this 
section. 

Theorem 5.1. The dominating minima reporting problem can be solved with Q ( n i g°iogn ) space, if the query 
is supported in C(log 7 n + t) time, where t is the size of the answer to the query and parameter 7 = 0(1). 



6 Conclusion 

We presented the first dynamic I/O-efficient data structures for 3-sided planar orthogonal range skyline 
reporting queries with worst case polylogarithmic update and query complexity. We also showed that the 
space usage of the existing structures for 4-sided range skyline reporting in pointer machine is optimal within 
doubly logarithmic factors. 

It remains open to devise a dynamic I/O-efficient data structure that supports reporting all m planar 
skyline points in 0(m/B) worst case I/Os and updatess in 0(log B n) worst case I/Os. It seems that the 
hardness for reporting the skyline in optimal time is derived from the fact that the problem is dynamic. 
The dynamic indexability model of Yi |21) may be useful to prove a lower bound towards the direction of 
rendering our structure for 3-sided range skyline reporting I/O-optimal, as defined by Papadias et al.|16). 
Finally it remains open to obtain a 0(|| log B n) space dynamic I/O-efficient data structures for 4-sided range 
skyline reporting with 0(log^n) worst case query and update I/Os, regardless of the I/O-complexity per 
reported point. 
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